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THE SPINDLY GENE , METHODS OF IDENTIFICATION AND USE 



Field of the Invention 

This invention relates to the regulation of 
5 plant growth and to the identification and use of a 
gene affecting the response to gibberellins . 



Background of the Invention 

Gibberellins (GAs) are a major class of 

10 plant hormones controlling many developmental 

processes including seed development and germination, 
flower and fruit development, and flowering time. 
Gibberellins (GAs) play a role in a number of growth 
and developmental processes in plants. Severe GA- 

15 deficient mutants may exhibit reduced seed 

germination, dwarfism of virtually all organs, and 
aberrant flower, fruit and seed development. Although 
the GA biosynthetic pathway has been studied for some 
time little is known about GA perception or signal 

20 transduction. 

Genetic analysis has uncovered two classes 
of mutants which are affected in their response to 
gibberellins. One class consists of dominant or semi- 
dominant mutants resembling gibberellin-def icient 

25 mutants but exhibiting reduced or no sensitivity to 

gibberellin. Mutants of this type have been isolated 
in maize, wheat and Arabidopsis thaliana 
(arabidopsis) . 

The second group, the "slender" mutants, 

30 have a recessive over-growth phenotype that is 

phenocopied by repeated treatments of wild type plants 
with gibberellin, and is consistent with a defect 
causing constitutive gibberellin response. These 
mutants have been studied in pea, tomato, barley and 

35 arabidopsis (Jacobsen, S.E. & Olszewski, N.E. (1993) 
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The Plant Cell 5:887-896). Slender mutants in pea, 
tomato and barley contain lower endogenous levels of 
gibberellins than wild-type plants. 

A slender phenotypic mutation in arabidopsis 
5 is termed the "spindly" or "spy" mutation. The 
spindly mutation is characterized by elongated 
petioles, yellow-green leaves, early flowering, long 
spindly bolts, partial male sterility and parthencarpic 
fruit development. These phenotypes are also observed 

10 in wild type plants exhibiting a gibberellin overdose 
syndrome due to external applications of gibberellin 
(Jacobsen, et al. supra). 

Spy phenotypic mutants were isolated from M2 
populations of ethyl methylsulf onate (EMS) -mutagenized 

15 wild-type seeds by selection for germination in the 
presence of the gibberellin biosynthesis inhibitor, 
paclobutrazol (Jacobsen, et al. supra). The 
phenotypes of the plants were typical of spy mutants 
and all of these phenotypes are observed, at some 

20 level, in wild type plants which have been repeatedly 
sprayed with GA3 . 

Growth retardants, typically in the form of 
sprays or washes are widely used, particularly in 
greenhouses to control plant growth. These chemicals 

25 act by reducing gibberellin levels of the plant. 

Direct treatment of plants with gibberellins or their 
analogs increase plant size over the entire treated 
area of the plant. While spindly and slender 
phenotypes have been described as early as 1922, the 

30 gene or genes associated with these phenotypes remain 
ellusive. Identification of the gene or genes 
associated with these phenotypes would permit 
gibberellin expression to be controlled without the 
need for chemical treatment. This will reduce 
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chemical usage and minimize a potential environmental 
hazard . 

Summary of the Invention 

5 This invention relates to the identification 

and use of a novel gene controlling plant growth and 
development by affecting responses to gibberellins . 

In one aspect of this invention an isolated 
nucleic acid fragment is claimed that comprises at 

10 least nucleic acids 2175-2405 of SEQ ID NO:l. The 

invention contemplates a vector comprising a promoter 
and a nucleic acid fragment comprising at least 
nucleic acids 2175-2405 of SEQ ID NO:l, cells 
containing the vector and transgenic plants expressing 

15 protein encoded by an exogenously derived nucleic acid 
fragment comprising at least nucleic acids 2175-2405 
of SEQ ID NO:l as well as transgenic plants encoding 
an exogenously-derived nucleic acid fragment 
comprising at least nucleic acids 2175-2405 of SEQ ID 

20 NO:l and a transgenic plant expressing a polypeptide 
encoded by an exogenously-derived nucleic acid 
fragment comprising at least amino acids 609-685 of 
SEQ ID NO:2. In a preferred embodiment the promoter 
is plant tissue specific and plant tissue specific 

25 vectors include promoters specific for fruit 
expression and for green-tissue expression. 

In another embodiment of the invention a 
transgenic plant is claimed that expresses a 
polypeptide where the polypeptide is encoded by an 

30 exogenously-derived nucleic acid fragment capable of 
hybridizing under highly stringent hybridization 
conditions to an isolated nucleic acid fragment 
comprising at least nucleic acids 2175-2405 of SEQ ID 
NO: 1 . 
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In another embodiment of the invention an 
isolated nucleic acid fragment is contemplated that 
contains the tetratricopeptide repeat region and 
comprises nucleic acids 579-1640 of SEQ ID N0:1. 
5 Vectors comprising a promoter and the 

tetratricopeptide containing region (nucleic acids 
579-1640 of SEQ ID NO : 1 ) are also contemplated along 
with cells containing the vector and transgenic plants 
expressing protein encoded by an exogenously-derived 

10 nucleic acid fragment comprising at least nucleic 

acids 579-1640 of SEQ ID NO:l. Isolated nucleic acid 
encoding the polypeptide of SEQ ID NO: 2 is also a part 
of this invention. The term exogenously-derived 
refers herein to nucleic acid fragments introduced 

15 into a plant that may or may not now be incorporated 
into the plant genome through methods of 
transformation that are known in the art. For 
example, a nucleic acid fragment is exogenously- 
derived if it was incorporated into a vector and is 

20 introduced into a plant irrespective of whether or not 
the plant contains an endogenous (its own) copy of the 
nucleic acid fragment. 

In another aspect of this invention a method 
is disclosed for identifying a gene from a plant 

25 comprising the steps of identifying at least a first 

clone from a cDNA library from a plant that hybridizes 
under low stringency hybridization conditions to a 
probe comprising nucleic acids 2175 to 2405 of SEQ ID 
NO:l, wherein the clone contains at least a portion of 

30 an open reading encoding a polypeptide with a C- 

terminus having at least 50% amino acid homology to 
amino acids 609-685 of SEQ ID NO:2, obtaining a full- 
length clone, wherein the full-length clone has about 
100% nucleic acid homology to the open reading frame 

35 of said first clone and determining if the full length 



4 



WO 97/43419 



PCT/US97/08765 



clone contains at least one te t rat ricopept ide repeat 
region. The determining step additionally may 
comprise determining if the tet ratricopeptide repeat 
region of the full-length clone hybridizes under 
5 stringent hybridization conditions to a nucleic acid 
fragment consisting of nucleotides 579-1640 of SEQ ID 
NO : 1 . 

The invention also relates to an isolated 
recombinant gene selected by the method of identifying 

10 at least a first clone from a cDNA library from a 
plant that hybridizes under low stringency 
hybridization conditions to a probe comprising nucleic 
acids 2175-2405 of SEQ ID NO : 1 , wherein the clone 
contains at least a portion of an open reading frame 

15 encoding a polypeptide with a C-terminus having at 

least 30% amino acid homology to amino acids 609-685 
of SEQ ID NO:2, obtaining a full-length clone, wherein 
the full-length clone has about 100% nucleic acid 
homology to the open reading frame of the first clone 

20 and determining if the full length clone contains at 

least one tetrat ricopeptide repeat region. Preferably 
the open reading frame of the gene has at least a 70% 
homology to SEQ ID NO:l 

An isolated gene from a plant is disclosed 

25 where the gene encodes a polypeptide where a portion 
of the polypeptide has a 30% amino acid homology to 
amino acids 609-685 of SEQ ID NO:2, wherein the 
polypeptide encoded by the gene includes a 
tetrat ricopept ide repeat region and wherein 

30 introduction of a vector directing expression of the 
gene into a plant produces a transgenic plant with a 
spindly phenotype. 

The invention also relates to a method for 
identifying a gene from a plant comprising the steps 

35 of searching a gene database for a nucleic acid 
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sequence encoding a polypeptide from a plant and 
having at least 30% amino acid homology to the amino 
acid fragment corresponding to amino acids 609-685 of 
SEQ ID NO: 2 and determining whether the open reading 
5 frame includes a tetratricopeptide repeat region. 

The following definitions are employed 
throughout this disclosure. 

In eukaryotes, RNA polymerase II catalyzes 
the transcription of a structural gene to produce 

10 mRNA. A DNA molecule can be designed to contain an 
RNA polylmerase II template in which the RNA 
transcript has a sequence that is complementary to 
that of a specific mRNA . The RNA transcript is termed 
an antisense RNA and a DNA sequence that encodes the 

15 antisense RNA is termed an antisense gene. Antisense 
RNA molecules are capable of binding to mRNA 
molecules, resulting in an inhibition of mRNA 
translation * 

The term cDNA library refers to a collection 

20 of nucleic acid fragments originally derived from 
reverse transcription of isolated mRNA. The cDNA 
library contains a variety of different clones with 
different sequences. The individual cDNA sequences 
may or may not contain open reading frames and they 

25 may or may correspond to a full length mRNA transcipt. 

A sibling plant refers to plants derived 
from seeds having a common parent. 

The tetratricopeptide repeat region of this 
invention refers to amino acids 77-430 of SEQ ID NO:2 

30 and to sequences having tetratricopeptide structural 
domains as enumerated in this disclosure. 

An expression vector is a DNA molecule 
comprising a gene that is expressed in a host cell. 
Typically, gene expression is placed under the control 

35 of certain regulatory elements, including constitutive 



or inducible promoters/ tissue-specific regulatory 
elements and enhancers. Such a gene is said to be 
"operably linked to" the regulatory elements 

A cloning vector is a DNA molecule, such as 
a plasmid, cosmid or bacteriophage that has the 
capability of replicating autonomously in a host cell. 
Cloning vectors typically contain one or a small 
number of restriction endonuclease recognition sites 
at which foreign DNA sequences can be inserted in a 
determinable fashion without loss of an essential 
biological function of the vector, as well as a marker 
gene that is suitable for use in the identification 
and selection of cells transformed with the cloning 
vector. Marker genes typically include genes that 
provide antibiotic resistance. 

Exogenous denotes some item that is foreign 
to its surroundings and particularly applies to a 
class of genetic contructs that is not found in the 
normal genetic complement of the host plant. The term 
"clone" refers to the exogenous insert that has been 
inserted into the cloning vector, here a SPY gene or a 
portion thereof. 

Heterologous is a noun modifier indicating a 
source that is different. For example, a heterologous 
promoter used with a structural gene of the present 
invention is a promoter that is different from that of 
the structural gene. 

An isolated recombinant nucleic acid 
fragment refers to RNA or DNA that is not originally 
from the genomic DNA of the particular organism into 
which it is introduced. The isolated fragment can be 
used as a probe, incorporated into an expression 
vector or otherwise used in a variety of molecular 
biology methods. 
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Two nucleic acid molecules are considered to 
have a substantial sequence similarity if their 
nucleotide sequences share a similarity of at least 
50% and protein encoded by the nucleotide sequences 
5 have a homology of at least 30%. Sequence similarity 
determinations can be performed using the FASTA 
program (Genetics Computer Group Madison, WI). 
Alternatively, sequence similarity determinations can 
be performed using BLASTP (Basic Local Alignment 

10 Search Tool) of the Experimental GENINFO Blast Network 
Service. See also Pasternak, et al . Methods in Plant 
Molecular Biology and Biotechnology, Glick, et al. 
(eds.), pages 251-267 ( Press, 1993). 

Stringent hybridization conditions are 

15 understood in this disclosure as those nucleic acid 

hybridization conditions normally used by one of skill 
in the art to establish at least a 90% homology 
between complementary pieces of DNA or between DNA and 
RNA. In this disclosure 90% homology is preferred for 

20 hybridization of the C-terminus fragment in screening 
strategies to identify other SPY genes. Lesser 
homologies using lower stringency conditions may be 
desired such as at least 50% homology or preferably at 
least 80% homology. 

25 A suitable promoter is a promoter that 

controls gene expression in cells that are to be 
altered developmentally by the manipulation of genes 
controlling response to gibberellin. 

A transgenic plant is plant having one or 

30 more plant cells that contain an expression vector. 

Brief Description of -the Figures 



Figure 1 is a schematic detailing the 
35 structure of the SPY gene and spy mutations. Broken 
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lines represent the region outside the 3.5 kbp cDNA. 
Unbroken lines represent introns. Solid bars 
represent exons. Introns and exons were identified in 
the genomic sequence by comparing the sequence of the 
5 genomic clone with that of the cDNA. The open bar 
represents the exon that is skipped in spy-1 and 
spy-2. Arrows indicate mutated nucleic acid residues. 
The triangle represents the T-DNA insertion. 

Figure 2 is the deduced amino acid sequence 

10 of the SPY protein (SEQ ID NO:2) . The protein is 

shown in three blocks, the N-terminus (residues 1-76) , 
the 10 tet ra tricopeptide repeats (TPRs, residues 77- 
4 30) aligned with each other, and the C-terminus 
(residues 431-914). Below the TPR alignment is a 

15 consensus sequence: amino acids are shown if present 
in at least 5 of the SPY TPRs. Residues indicated 
with a double underline have the following alterations 
in the mutant alleles: Met 354 to Gin 376 are absent in 
spy-1 and spy-2, Gly 593 is converted to Ser in spy-3, 

20 and Cys 845 is converted to Tyr in spy-5. Abbreviations 
for the amino acid residues are A, Ala; C, Cys; D, 
Asp; E, Glu; F, Phe; G , Gly; H, His; I, lie; K, Lys; 
L, Leu; M, Met; N , Asn; P, Pro; Q, Gin; R, Arg; S, 
Ser; T, Thr; V, Val; W, Trp; and Y, Tyr. 

25 Figure 3 is the sequence of the SPY cDNA 

(SEQ ID NO:l). The start codon for translation, (ATG) 
and the stop codon (TAG) are indicated by the double 
underline. The single underline indicates the region 
deleted in mRNA of spy-1 and spy-2. Nucleotides 

30 mutated in spy-1, spy-3 and spy-5 are indicated in 
bold and the nucleotide of the mutant is indicated 
below . 

Figure 4 is a sequence of the SPY genomic 
region (SEQ ID NO: 3). The bold G and C which are also 
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underlined correspond to the first and last 
nucleotides of the SPY cDNA . 

Figure 5 is a Blast Alignment of the SPY 
protein sequences with a translation of a maize cDNA. 
5 A BLAST alignment of a portion of the SPY protein (SPY 
Prot) and a translation of a maize cDNA (Maize) . The 
consensus of this alignment are indicated between the 
SPY and maize sequences (consensus). The statistics 
provided by the BLAST program are included below the 
10 alignment. 

Figure 6 provides the maps of 3850:1003 T- 
DNA, as the T-DNA existed in spy-4 plants and the 
plasmid rescued from spy-4 . Figure 6A illustrates the 
structure of the 3850:1003 T-DNA . Restriction sites 

15 are indicated above the map and the sizes of different 
segments of the T-DNA are indicated below the map. 
Figure 6B indicates the highly deleted form of the T- 
DNA as it exists in the spy-4 genome. Figure 6C 
indicates the structure of the construct obtained by 

20 plasmid rescue employing the restriction endonuclease 
Sacl. The thin line on the left side of the T-DNA 
corresponds to the arabidopsis sequences used to probe 
genomic libraries constructed in pOCA18 to obtain the 
SPY gene. The probe extended from the Sacl site at 

25 the left end of the map to the PvuII site near the 

junction between the arabidopsis and pBR322 sequences. 

Detailed Description of the Praforred Embodiments 

The identification of the genes involved in 
gibberellin signal transduction and regulation have a 
30 number of commercial applications. For example, 

overproduction of gibberellin results in increased 
plant growth while underproduction of gibberellin 
reduces plant growth. Control of gibberellin levels 
provides a mechanism for controlling plant development 
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including plant height, fruit growth, flower 
development, leaf size, and the like. In maize, 
higher levels of GA result in increased vigor in 
hybrid maize (Rood, et al. (1988) Science 241:1216- 
5 1218) . 

The present invention discloses the 
identification and use of a gene that is involved in 
gibberellin signal transduction. Inactivation of the 
gene produces a spindly phenotype and introduction of 
10 the gene into plants rescues the spindly phenotype. 
Methods are disclosed for isolating the gene in a 
variety of plant types and using the gene to effect 
changes in the level of gibberellin expression. 

Slender mutations in plants have a number of 
15 similar phenotypic characteristics. In arabidopsis, 
the slender mutant phenotypes are characterized as 
producing plants that are physically similar to plants 
that are repeatedly treated with gibberellins . 
Although slender phenotypes have been identified, the 
20 relationship of the phenotype to one or more genes has 
not been understood. In previous studies a plant 
growth regulator, paclobutrazol , was used to identify 
mutants with altered gibberellin synthesis pathways. 
Paclobutrazol inhibits the monooxygenases involved in 
25 the oxidation of ent-kaurene to en t- kaurenoic acid and 
reduces the plant's ability to synthesize active 
gibberellins . 

Three spy mutants were identified that 
demonstrated resistance to paclobutrazol treatment 
30 (Jacobsen, S.E., et al . (1993) The Plant Cell 5:887- 
896) . The gene or genes conferring the phenotype 
could not be predicted because the location of the 
gene was not known. 

To identify the location of the spindly 
35 (spy) mutant, a spy phenotypic mutant was identified 

11 
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by first isolating a mutant expressing the spy 
phenotype from a library of more than 4,000 
Agrobacterium-mediiatedi seed transformation lines of 
arabidopsis having T-DNA insertions positioned 
5 throughout the genome (Arabidopsis Biological Resource 
Center (stock number CS3115 Columbus, Ohio) and was 
originally produced by K.Feldman {The Plant Journal 
1:71-82, 1991). The T-DNA library was produced by 
inoculating germinating seeds with Agrobacter ium 

10 containing 3850:1003 T-DNA and selecting progeny that 
are kanamycin resistant. (Feldman supra) 

The T-DNA insert includes an engineered 
plant selectable marker that confers resistance to 
kanamycin. A kanamycin resistance gene functions in 

15 E. coll and Agrojbacteriu/n tumefaciens . In addition 

the T-DNA includes two copies of pBR322 that contains 
genes conferring bacterial resistance to ampicilin and 
tetracycline. A detailed discussion of the T-DNA 
insert is provided in Feldman (supra) and will not be 

20 detailed here. 

Seeds from the library were selected for 
their ability to germinate in the presence of 
paclobutrazol (Jacobsen, supra). Pool number CS2635 
contained two paclobutrazol resistant seeds <spy-4) 

25 that yielded plants with a spy phenotype. Similar to 
the spy alleles (Jacobsen, supra) , spy-4 flowered 
earlier than wild type and displayed pale green 
foliage, partial male sterility and parthenocarpic 
fruit development. Other phenotypic details of spy-4 

30 are provided in Example 1. 

To determine whether or not the spy 
phenotype is produced from one or more genes, spy-4 
was back crossed to a progenitor Ws plant. Of 107 F2 
seedlings grown on IX MS plates (Sigma Chemical, St. 

35 Louis, MO) supplemented with 1% (w/v) sucrose in 1% 
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(w/v) agar containing 50 pg/ml kanamycin (kan). 80 of 
the seedlings were kan resistant. This suggested the 
presence of a single kan locus. Of 43 paclobutrazol 
resistant seedlings in the F2 population, all were kan 
5 resistant. In contrast, only two thirds of the non- 
paclobutrazol resistant seedlings were kan resistant 
indicating a linkage between kan resistance and 
paclobutrazol resistance. In addition, seeds produced 
by crossing spy-4 and spy-1 plants germinated in the 

10 presence of paclobutrazol, the resulting Fl plants 
exhibited a spy phenotype indicating that spy-1 and 
s Py-4 were allelic. 

Segregation of the T-DNA kan resistance 
marker and spy-4 in backcrosses is consistent with the 

15 conclusion that spy-4 plants contain a single 

functional kan resistance locus that is linked to spy- 
4 . 

Although kanamycin resistance segregates as 
expected for a single Mendelian locus, it was still 

20 quite possible that spy-4 contained multiple linked T- 
DNA inserts or additional T-DNA inserts in which the 
KanR gene was either mutated or not expressed due to 
position effects. 

Plasmid rescue was used to isolate plant DNA 

25 flanking the T-DNA insertion site after a genetic 
linkage was demonstrated between the kanamycin 
resistance locus resident in T-DNA and the spy-4 
phenotype. The standard T-DNA plasmid rescue 
procedure involves the identification of restriction 

30 endonucleases that digest the T-DNA and regions 

flanking it such that the products of the reaction 
contain some DNA fragments that contain an intact 
Arabidopsis DNA flanking the T-DNA, bacterial origin 
of replication and a selectable marker for use in 

35 bacteria (Olszewski, N. E., et al. (1988) Nucleic 



Acids Research 16:10765). The products of this 
reaction are then ligated with T4 DNA ligase to 
produce monomer circles. The products of this 
ligation are introduced into E.coli and cells 
receiving the ligation product containing the T-DNA 
sequences and flanking Arabidopsis sequences are 
selected by virtue of their resistance to an 
antibiotic . 

Standard plasmid rescue procedures could not 
be used in this instance because the T-DNA mutant had 
undergone mutations in the T-DNA insertions and the 
piasmids could not be rescued by standard methods 
known in the art. Because of the mutations in the 
inserts positioned near or in the spindly gene, there 
were no predictable protocols for rescuing the gene. 
These problems and the methods for identifying the 
gene are discussed in detail in Example 2. 

Comparison between the sequence of the cDNA 
and the corresponding genomic region indicated that 
the spy mRNA is composed of 18 exons which have the 
potential to encode a 914 amino acid protein (see 
Figures 1 and 2) . Nucleotides 351-579 encode the 
unique N terminus (res. 1-76) and nucleotides 580-1637 
encode the TPR domain (res. 77-430). This region 
contained 10 copies of the TPR repeat. Nucleotides 
1638-3091 encode the unique C-terminus (res. 431-914). 
The region from 1638-3091 did not include the TPR 
region and based on a search in sequence databanks is 
specific for SPY sequences. The SPY amino acid 
sequence was determined from the nucleic acid sequence 
identified from the isolated clone. The amino acid 
sequence was used to search protein databases for 
similar sequences. No close matches were identified. 

As noted above, the SPY gene contains a 
tetratricopeptide repeat (TPR) . In SPY, this repeat 

14 
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was identified as a 34 amino acid repeated sequence 
motif that has been found in a few other proteins 
(Sikorski, R . S., et ai. (1991; Cold Spring Harbor 
Symposia on Quantitative Biology LVI: 663-673 and Lamb, 
5 J . R . , et al. (1995) TIBS 20:257-259). Figure 2 shows 
an alignment of the 10 TPR repeats found in the N- 
terminus of the predicted SPY protein and a consensus 
sequence for the 10 repeats. 

SPY cDNAs from the paclobutrazol-induced 

10 phenotypic mutants, spy-1, 2, 3 (Jacobsen, supra) and 
spy-5, another phenotypic mutant (Wilson, et al.(1995) 
Plant Phys. 108:495-502) were sequenced to confirm 
that the isolated sequence yielded the RNA sequence 
encoding the SPY protein. Each allele contained a 

15 mutation. The spy-3 (G at 2127 is mutated to an A) 
and spy-5 mutations (G at 2884 is mutated to an A) 
caused amino acid substitutions in the C-terminus of 
the protein . 

A shorter PCR product was obtained from 

20 spy-1 and spy-2 RNA than from wild-type RNA. The 

eighth exon was missing in the RNAs from spy-1 and 
spy-2 plants. Reverse transcriptase followed by 
polymerase chain reaction (RT-PCR) was used to amplify 
the region containing the 8th exon to determine which 

25 portions of the exon was missing in the spy-1 and spy- 
2 mutants. Preferred methods for determining the 
sequences are provided in Example 2. Sequencing of the 
genomic DNA from these two lines showed that the spy-1 
mutation affects the 5' exon-intron junction of the 

30 eighth exon, while spy-2 affects its 3' intron-exon 
junction. Both spy-1 and spy-2 plants lacked the 
eighth exon from residues 1410 to 1478 (Figures 1 and 
2) . 

An analysis of the mutants indicated that 
35 the TPR regions are important for SPY protein function. 
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Mutations in the spy-1 and spy-2 plants corresponded to 
a loss of 23 amino acids, including the last 9 amino 
acids of the eighth TPR and the first 14 amino acids of 
the ninth TPR. The spy-1 and spy-2 mutations caused the 
5 loss of the region from Met (354) to Gin (376), This 
indicated that TPR repeats were needed for wild type 
activity. The spy-3 and spy-5 mutations converted 
Gly(593) to Ser and Cys(845) to Tyr respectively. This 
indicated that the unique C-terminal region was also 

10 important. Analysis of the SPY sequence indicates that 
SPY proteins contain a TPR domain and a unique C- 
terminal region . 

Final confirmation that the identified gene 
encoded SPY was provided by the observation that a 

15 cosmid clone containing the SPY gene (clone 2118) 

complemented the paclobut razol germination defect of 
the spy~l mutant (see Example 2). Southern blot 
analysis confirmed that SPY is a single copy gene. 
Using a restriction length fragment polymorphism 

20 present in a SPY containing cosmid clone, the SPY 
locus was mapped to the top of chromosome 3 in 
Arabldopsls thaliana between RFLP markers lAt243 and 
hsp70-9 . 

Based on several gene database searches 
25 (GenBank, EMBL, PIR and Swiss-Prot), it is evident that 
the SPY protein contains a unique N-terminus (res. 1-76 
cDNA nucleotides 351-578, see SEQ ID NO:l), a unique 
TPR domain (res. 77-430, nucleotides 579-1640, see SEQ 
ID NO:l) containing 10 copies of the tetratricopeptide 
30 repeat, and a unique C terminus (res. 431-914, 
nucleotides 1641-3092, SEQ ID NO:l ) 

SPY is only the second TPR gene known in 
plants (Torres, J. H.,et al. E. (1995) Plant Mol . 
Biol. 2:1221-1226). Other TPR-containing genes are 
35 known and there is a growing family of TPR proteins 



which perform diverse functions. Without intending to 
limit the scope or content of this invention in any 
way, among the proposed functions for TPR proteins are 
transcriptional repression, mitochondrial and 
peroxisomal protein transport, cell cycle regulation, 
protein kinase inhibition, and heat shock response 
(Sikorski, et al. supra, and Lamb, et al.(1995), 
supra) . 

There is little in common with the general 
function of these proteins, but they are often found 
in protein complexes and it has been proposed that the 
TPRs form amphipathic alpha-helices that mediate the 
protein-protein interactions. For CYC8 , a yeast 
transcriptional repressor containing 10 TPRS, it was 
shown that the first three TPRs are necessary and 
sufficient for direct interaction with another non-TPR 
protein, TUP1 (Tzamarias and Struhl, 1995 Genes Dev. 
9:821-831). The CYC8-TUP1 complex is thought to be 
recruited by specific DNA binding proteins and acts as 
a transcriptional repressor. Thus, without intending 
to limit the scope or content of this invention, it is 
likely that SPY acts to suppress GA signal 
transduction in part by interacting with other 
proteins through one or more of its TPR domains. 

While there are no obvious sequence motifs 
in the 485 amino acid non-TPR carboxy-terminal region 
that might indicate a specific biochemical function, 
the spy-3 and spy-5 mutations demonstrate that this 
domain is important for normal SPY activity. 
Comparison of this region with protein database 
sequences reveals similarity with a predicted protein 
from C. elegans, K04G7.3 (Wilson, R. et al . (1994) 
Nature 368:32-38)) and less similarity with ESTs from 
the blood fluke Schistosoma mansoni (accession T14591 
GenBank) and human (accession R76782 GenBank) . The 

17 



WO 97/434H9 



PCT/US97/08765 



glycine affected in the spy-3 allele is conserved in 
K04G7.3, while the cysteine affected in spy-5 is not. 
Although the function of K04G7 . 3 is unknown, it also 
contains an N-terminal TPR region which exhibits the 
5 highest level of similarity to the SPY TPR domain. 
This suggests that SPY and K04G7.3 are members of a 
new class of regulatory molecules that is likely to be 
present throughout the eukaryotes. 

The SPY gene of arabidopsis contains a novel 

10 C-terminal domain (cDNA nucleotides 1641-3092 of SEQ 
ID NO:l and in particular amino acids 632-685 and the 
corresponding nucleotide residues 2238 to 2405} that 
is useful for identifying SPY genes in other plants. 
In a particularly preferred method of this invention 

15 probes from the C-terminus are used to screen a cDNA 
library. Example 4 provides a preferred strategy for 
library of maize sequence tags and a maize cDNA 
library (see Example 4). The results of this study 
identified the conserved C-terminus in maize. Figure 

20 5 provides a sequence comparison between a portion of 
the C-terminus of arabidopsis and maize. A 
particularly preferred region of the C-terminus for 
use as a probe is the region between amino acids 609 
to 685 of SEQ ID N0:2 corresponding to nucleic acids 

25 2175-2405 of SEQ ID NO:l and another preferred region 
includes the region extending from 631 to 685 of SEQ 
ID NO:2 that corresponds to nucleic acids 2244-2405 of 
SEQ ID N0:1. Hybridization studies using the C- 
terminus as a probe of southern blots of genomic DNA 

30 from turnip and rice also indicate the presence of a 
single SPY gene. 

Gibberellins are important in corn 
development. There is a correlation between increased 
vigor in hybrid maize and higher gibberellin levels 

35 compared to parental levels, and the greater response 
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of inbreds (compared to hybrids) to exogenously 
applied gibberellin. The gibberellin biosynthetic 
loci appears to be a quantitative trait loci for 
height in maize hybrids. The importance of 
5 gibberellins in plant development is further evidenced 
in the phenotype of gibberellin-def icient mutants of 
maize, which includes: reduced plant stature, due to 
shorter internode lengths; shorter broader leaves; 
less branching of the tassels; and the development of 

10 anthers on the normally spitillate ear resulting in 
perfect flowers (Emerson, et al. (1922) Genetics 
7:203-227). Gibberellins affect the development of 
pistils and stamens in maize by arresting development 
of the stamens in both florets of the ear. This 

15 results in a staminate flower in the proximal floret 

and a mature perfect flower in the distal floret. The 
development of pistils and stamens in the tassel of 
gibberellin deficient mutants is delayed, but 
otherwise is unaffected. 

20 Example 2 discusses the introduction of the 

Arabidopsis SPY gene into a spy phenotypic mutant to 
partially correct the effect of the endogenous mutated 
spy gene. Growth retardants are widely used, 
particularly in greenhouses to control, typically 

25 reduce, plant growth. Since these chemicals act by 

reducing gibberellin levels of the plant it should be 
possible to create plants with increased SPY protein 
levels that do not require chemical treatment. This 
will reduce chemical usage and minimize a potential 

30 environmental hazard. SPY offers an additional 

advantage in that it is likely to act only in the 
tissues/organs where its expression is altered thereby 
offering greater control than can be afforded using 
growth retardants that act systemically . 
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Genes or portions of genes are identified 
using searches of gene databases or by hybridization 
studies as disclosed in Example 4. Related sequences 
can be identified by hybridization at high stringency 
5 (conditions that allow sequences with/ that 90% 

identity to anneal) or lower stringencies (80 or 70% 
identify) Since the TPR repeats are not highly 
identical and TPR repeats of SPY genes are expected to 
be more similar to each other than to TPR regions of 

10 non-SPY genes, hybridizations with this regions can be 
used to identify SPY genes. Sequence homologies in 
gene database searches for genes with TPR homologies 
are considered signicant if the protein homology is at 
least 50%. If the TPR region is employed for database 

15 searching or hybridization studies, the presence of the 
non-TPR regions must be confirmed by hybridization or 
DNA sequencing. Sequence homologies in gene database 
searches for genes with homology to the C-terminus or a 
portion thereof are considered significant if the 

20 protein sequence has a 30% homology. 

The SPY gene from a variety of plants can be 
incorporated into a variety of plant expression vectors 
to produce phenotypic modifications. Methods for 
incorporating the SPY gene into a suitable expression 

25 vector and the introduction of the vector into either 
monocots or dicots are well known in the art. Methods 
for transforming plant cells include microinjection, 
electroporation, Agrobacterium mediated transformation, 
direct gene transfer and particle bombardment such as 

30 using devices available from Agracetus, Inc. Madison 
WI, and the like. Moreover preferred methods for 
introducing exogenous genes into plant cells and the 
production of transgenic plants is well known for a 
variety of plants. PCT Publication No. WO 94/28141 to 

35 Lange, et al . references many publications that 
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disclose methods for transforming a variety of plant 
tissues . 

Plants that have reduced levels of active 
SPY protein are taller than normal plants. Results 
5 provided by data disclosed here suggests that when SPY 
protein levels are increased by introducing extra 
copies of the SPY gene into plants, the plants have 
reduced stature and these methods and the gene can be 
used to modulate plant stature as well as early and 
10 late flower maturity. Development of semi-dwarf 
varieties of many crops (wheat) have been very 
important and have greatly enhanced yield. Treatment 
with gibberellins are used to enhance growth (grapes). 
In some cases it may be possible to create varieties of 
15 grapes with altered SPY protein levels that do not 
require treatment with gibberellins. 

Inhibition of SPY expression can be used to 
mimic the effects of gibberellin overexpression in 
plants. In a preferred method for underexpression, 
20 antisense cloning methods are used. Antisense cloning 
has been demonstrated to be effective in plant systems 
and can be readily adapted by one of ordinary skill in 
the art using the SPY gene using published methods for 
antisense cloning of other genes (Oeller et al., (1991) 
25 Science 254:437-439,). 

In general antisense cloning entails the 
generation of an expression vector encoding an RNA 
complementary (antisense) to the RNA encoding the SPY 
gene. By expressing the antisense RNA in a cell 
30 expressing the sense strand, hybridization between the 
two RNA species will occur resulting in the blocking of 
translation . 

Gibberellins are involved in germination and 
exogenous applications of gibberellins can break seed 
35 dormancy. These observations suggest that manipulation 
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of SPY levels may allow control of the process of 
germination. Overexpression of SPY may prevent 
germination but this inhibition may be overcome by 
treatment with gibberellins . This suggests that iz is 
5 possible to develop strategies that allow propagation 
of elite varieties to be controlled by the producer of 
these varieties. 

Antisense SPY or SPY-containing constructs 
can be used to shorten or lengthen the time to seed 
10 maturity of plants. Increased signal transduction 

shortens the time to flowering. Shortening the time to 
maturity is an advantage in some growth zones whereas 
increased time to maturity is an advantage in other 
growth zones. 

15 The potential to specifically alter the 

growth of an organ or tissue is something that is not 
easily accomplished using conventional breeding or 
chemicals. Therefore, SPY opens avenues for 
improvement of plants that have not been pursued 

20 previously. Mutations in the SPY gene cause 

parthenocarpic (development without fertilization) 
fruit development. Therefore, plants engineered for 
under expression of SPY in flowers may produce seedless 
parthenocarpic fruit . 

25 This invention also relates to tissue 

specific expression or repression of the SPY gene. The 
term tissue-specific expression refers to expression or 
inhibition of the SPY gene in a particular tissue. 
Tissue-preferential and tissue-specific promoters are 

30 used to effect tissue-specific expression meaning that 
the tissue-specific promoter is used to indicate that a 
given regulatory DNA sequence will promote 
transcription of an associated expressible DNA sequence 
entirely in one or more tissues of a plant, or in one 

35 type of tissue, while essentially no transcription of 
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that associated coding DNA sequence will occur in all 
other tissues or types of tissues of the plant. 
Numerous promoters whose expression are known to vary 
in a tissue specific manner are known in the art. Once 
5 example is the maize phosphoenol pyruvate carboxylase 
promoter, which is green tissue-specific. Other green 
tissue-specific promoters include chlorophyll a/b 
binding protein promoters and RubisCo small subunit 
promoters. Pollen-specific promoters are available 
10 from plant calcium-dependent phosphate kinase gene. 

The Example 4 and 5 disclose methods for 
incorporating SPY genes in other plants. Tissue 
specific constructs producing the antisense copy of the 
SPY RNA would reduce SPY protein levels producing more 
15 growth of the tissue, i.e. increased stature, while 

tissue specific constructs producing the SPY mRNA would 
cause an increase in SPY protein and hence less growth 
of the tissues, i.e. reduced stature in those tissues 
where the SPY gene was expressed. The tissue/organ 
20 specificity of the constructs could be controlled using 
promoters with appropriate properties to drive the 
expression of the constructs. 

In addition to controlling growth, 
gibberellins also affect germination of seeds, time of 
25 flowering and response to stress. Therefore, by 
altering SPY levels as described above it may be 
possible to modify these processes as well. Seeds with 
reduced levels of SPY protein exhibit reduced dormancy 
and plants with reduced SPY protein flower earlier. 
30 The present invention is illustrated in 

further detail in the following examples. These 
examples are included for explanatory purposes and 
should not be considered to limit the invention. 

35 
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Exasacpl© 1 
Description o£ the spy- 4 allele 
Although spy-4 had characteristics of other 
spy alleles, in addition, spy-4 exhibited obvious 

5 partial dominance with respect to flowering time. 
Under long day conditions, wild-type plants of the 
ecotype Wassilewski j a (Ws) flowered after producing 
9.4 + /- 0.5 rosette leaves, plants heterozygous for 
spy-4 flowered after producing 5.5 +/- 0.2 rosette 

10 leaves, and plants homozygous for spy-4 flowered after 
producing only 2.0 +/- 0.0 rosette leaves. Plants 
heterozygous for spy-4 did not exhibit partial 
sterility and seeds heterozygous for spy-4 failed to 
germinate in the presence of paclobutrazol , indicating 

15 that these traits are fully recessive. The homozygous 
flowering time phenotype suggests that spy-4 is the 
strongest of the spy alleles identified. 



Example 2 

20 Isolation and Sequencing of SPY gene 

The SPY gene was cloned with the aid of the 
T-DNA insert. The use of the T-DNA system for cloning 
is not predictably successful because the sequence 
incorporates randomly into the genome and there is no 

25 guarantee that any particular gene will contain an 
insert creating a recognizable mutant phenotype. 
Selection of a particular mutant phenotype does not 
guarantee that the mutant relates to a single gene. 
Moreover mutant phenotypes with reduced stature are 

30 rare in T-DNA libraries. 

By hybridizing blots containing spy-4 DNA 
with subcloned portions of the 3850-1003 T-DNA, it was 
determined that the spy-4 plants contained a single 
partial T-DNA. When 28 F3 families each derived from 

35 F2 plants that were either wild-type or heterozygous 



for spy-4 were analyzed for the presence of pBR322 
sequences, the pBR322 sequences cosegrega ted perfectly 
with spy-4. These results indicated that the T-DNA 
insert was responsible for the spy-4 allele and 
demonstrated that the T-DNA sequence was useful as a 
probe to clone SPY. 

Because of alterations to the 3850:1003 T- 
DNA of the spy-4 genes, it was necessary to modify the 
standard cloning methods (Feldman, K.A. (1992) T-DNA 
insertion mutagenesis in Arabidopsis: Seed 
infection/ transformation . In: Methods in Arabidopsis 
Research, C. Koncz, et al. World Scientific Publishing 
CO. Singapore pp. 274-289). A restriction map of the 
region containing the T-DNA and the surrounding DNA 
was produced. As shown in the map provided in Figure 
6A, T-DNA typically contains two pBR322 regions, a 
left border and a right border and working restriction 
endonuclease recognition sites distributed throughout 
the T-DNA. Analysis of the restriction map indicated 
that the ends of the T-DNA were deleted (Figure 6B) . 
The regions deleted included the left and right border 
sequences and a portion of each pBR322 sequence. In 
addition, non-T-DNA sequences from Agrobac cerium 
tumefaciens, identified by DNA sequencing, were 
located adjacent to the right border side of the T- 
DNA. Detailed restriction mapping of the remaining 
portions of the two copies of pBR322 suggested that 
the copy normally adjacent to the right border might 
be functional but that the partial copy normally 
adjacent to the left border, which was located 
adjacent to arabidopsis sequences was not functional. 
Moreover, the ampicillin and kanamycin resistance 
markers appeared to be intact. The T-DNA sequences 
was analyzed and restriction enzymes that did not have 
recognition sites in the T-DNA were identified by 
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trial and error. The enzymes tested included Bell, 
Bglll, Kpnl, NotI, Sad, Spel and Xbal . In testing 
each of these enzymes in plasmid rescue, two enzymes 
appeared to work. These were Bell and Sad. The SacI 
5 derived plasmid was selected because it was larger 
(Figure 6C) . 

Briefly, spy-4 genomic DNA was digested with 
each of these enzymes individually, the DNA in each 
reaction was then diluted to a concentration that 
10 favored circularization , rather than 

concatermerization, of the digested products and self- 
ligated using T4 DNA ligase. The resulting ligation 
product was introduced into E.coli (MC1061) by 
electropora t ion . Transf ormants were selected on 
15 medium containing kanamycin (2 5|ig/ml) and ampicillin 
(75 ng/ml) . Samples from the Bell and SacI digestion 
yielded transf ormants . Restriction maps of plasmids 
isolated from these transf ormants were then generated 
and compared to the map of the genomic region to 
20 identify clones containing the T-DNA and flanking 

arabidopsis sequences. Plasmids from both the SacI 
and Bell rescue experiments had restriction maps 
consistent with them containing arabidopsis DNA 
sequences flanking the T-DNA causing the mutant 
25 phenotype. The Sacl-derived plasmid was then used as 
a probe to obtain the wild-type SPY gene. 

A 5.8 kb Sail to SacI fragment containing 
DNA and pBR322 that was derived from the Sacl-rescued 
plasmid was subcloned into pBluescript II KS 
30 (Stratagene, La Jolla, CA) . The resulting plasmid was 
digested with PvuII. This effectively separated the 
pBR322 sequences from the arabidopsis sequences but 
generated a restriction fragment in which the 
arabidopsis sequences were linked to the promoter for 
35 T7 RNA polymerase which is located directly adjacent to 
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the cloning site in pBluescript II KS . The digestion 
products were transcribed by T7 RNA polymerase in the 
presence of 32 P-UTP to generate a hybridization probe 
that was specific for arabidopsis sequences flanking 
5 the T-DNA insertion. This probe was purified and used 
to probe an arabidopsis genomic library constructed in 
the binary plant transformation vector 

POCA18 (Olszewski, (1988), supra). 30,000 colonies were 
grown and transferred to hybridization membranes and 

10 hybridized with the labeled RNA probe using standard 
methods for hybridization (Sambrook, et al . (1989) 
Molecular Cloning: A laboratory manual. Cold Spring 
Harbor Press). This screening identified six colonies 
containing plasmids that hybridized to the probe. The 

15 colonies were grown and the cosmids were isolated. 

Further analysis of two of these cosmids, 2118 and 4273 
confirmed that they contained sequences that hybridized 
to the original probe and they were used in all 
subsequent experiments . 

20 The EcoRI region from cosmid 4273 was 

labeled and used as a probe to probe a wild-type 
arabidopsis cDNA library. Clones were selected, 
sequences and compared to the plasmid rescue. A clone 
was selected that was closest to the T-DNA insert 

25 site. This clone contained a 3.5kb fragment. The 5 r - 
end of one 3.5 kb cDNA clone was found to be 13 bp 
downstream of the T-DNA insertion, and this cDNA was 
presumed to encode the SPY protein. Northern blot 
analysis indicated that RNA from wild-type and the 

30 four EMS alleles contain roughly similar amounts of a 
3.5 kb transcript, which hybridized to this cDNA, but 
RNAs hybridizing to this cDNA were not detectable in 
spy-4 (not shown) . This correlates with the 
observation that spy-4 is the strongest spy allele, 
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and suggests that spy-4 may be an RNA null allele and 
that the 3.5 kb cDNA was a cDNA of SPY rnRNA. 

The 3.5 kb SPY cDNA was subcloned from 
lambda GT22A using polymerase chain reaction (PGR) 

5 followed by TA cloning (Invitrogen, San Diego, CA, 

92121). Three independent clones were sequenced using 
the Sequenase Version 2.0 DNA sequencing Kit (United 
States Biochemical Corporation; Cleveland, OH 44122) . 
This sequence is designated SEQ ID NO;l. The protein 

10 encoded by this gene is designated SEQ ID NO:2. To 
sequence the spy alleles, reverse transcriptase PCR 
was performed on total RNA from the various alleles. 
The primers used for RT-PCR to generate cDNA were the 
same as those used to sequence the mutant alleles. 

15 Table 1 below provides the primers used in these 
experiments . 



Name 


SEQ ID NO: 


Sequence ( 5 ' -3 ' ) 


JP67 


4 


CTTCTCTTATGTCTACTCAA 


JP48 


5 


TGTTGAAGCTCACATTGGAA 


JP66 


6 


CACAGCTTTATCAAGGTT 


JP50 


7 


GGCCATAGCTCTGACA 


JP72 


8 


GAGAAATGCTAAAGTTCGACAT 


JP64 


9 


TACAGAGATGCTGGAA 


JP62 


10 


GCAGCAGAAAGGTTTGCAT 


JP63 


11 


ATCCAAATACTACGGGTT 


JP71 


12 


CTTTCACACCTTTTCCAGAT 


JP58 


13 


GAGATCCAGCCATTAGAT 



30 

RT-PCR products were amplified from a region 
of the SPY RNA containing the eighth exon for wild 
type and the spy mutant alleles. By size, the RT-PCR 
product was smaller in spy-1 and spy-2 than in the 
35 wild type. RT-PCR products of the two alleles spy-1 
and spy-2 had lost the 8th exon indicating that they 
had been misspliced. RNA from the wild type ecotype 
Ws or from the spy mutants was subjected to RT-PCR. 
Products were digested with HindIII f subjected to 
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agarose gel electrophoresis, stained with ethidium 
bromide and visualized using UV light. A shorter PCR 
product was obtained from spy-1 and spy-2 RNA than 
from wild-type RNA. The sizes of the products are 
5 consistent with that expected in both the wild type 
(416 bp) and the splicing mutants (347 bp) . 

To sequence wild-type genomic DNA, an 8 
kilobase Xbal fragment was subcloned from cosmid 2118 
into pBluescript KS + (Stratagene, La Jolla, CA) , and 

10 all intron and exon sequences were determined (6467 

base pairs) . To sequence the intron/exon junctions in 
the spy-1 and spy-2 mutants, genomic DNA was subjected 
to PCR using two primers JP81 (SEQ ID NO: 14) with 
sequence GCTTTCCACTTTAATCCACAT and JP82 (SEQ ID NO: 15) 

15 with sequence GAAGATGAGAAAACAGACCT . 

and the products were sequenced directly. 

Successful cloning of the SPY gene was 
confirmed because northern blots of mutant message as 
compared to wild type message hybridized with wild type 

20 cDNA probes indicated that SPY expression was lacking 

in the spy-4 mutant. The mutant alleles were sequenced 
to demonstrate the presence of mutations in each 
allele. The SPY gene was transformed using methods 
described in Valvekens, D., et al.(1988) PNAS (USA) 

25 85:5536. into the spy-1 mutant and the transgenic spy-1 
mutant containing the SPY gene had a partial reversion 
in its mutant phenotype. The SPY gene contains 17 
introns and encodes a 3.5 kb transcript. The deduced 
protein contains 914 amino acids and contains 10 TPR to 

30 demonstrate that the isolated gene produced the 
observed phenotypic trait. 
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Example 3 
Characters zati n of the SPY protein 

The SPY cDNA (see SEQ ID N0:1) is cloned 

5 into an expression vector such as pMALC {New England 

Biolabs) and expressed in E.coli. The protein is 

purified using methods known in the art such as 

affinity chromatography on a maltose containing column 

and used for biochemical studies designed to test the 

10 function of the SPY protein. The antisera is prepared 
to peptides or purified spy protein. Western blotting 
and tissue printing (Cassab and Varner (1987) J. Call 
Biol 105:2581-2588, 1987) are used to assess the size, 
abundance and distribution of the spy protein in the 

15 different plant organs. Immunolocaliza tion studies 
are performed to localize SPY within the cell 
(Varagona, et al . 1991 The Plant Cell 3:105-113). 
These methods are also used to detect changes in 
protein expression during the transition from 

20 vegetative growth to mature plant. 

During the course of the current study, it 
was discovered that the previously described spy-1 
line (Jacobsen, supra) most likely also carries a 
linked mutation at the HY2 locus. This linkage was 

25 identified because the HY2 locus maps in the same 

approximate region as SPY. Similar to spy mutants, 
hy2 mutants are early flowering and have pale green 
foliage. However, hy2 mutants do not exhibit male 
sterility and do not germinate in the presence of 

30 paclobutrazol (Jacobsen, et al. supra). Genetic 

complementation tests indicated that spy-1 fails to 
complement hy2-l, while spy-4 and spy-5 fully 
complement hy2-l. Sequencing of the SPY cDNA from two 
hy2 alleles detected no mutations in the coding 

35 region, and RNA blot analysis indicated that there was 



30 



no change in SPY RNA size or abundance in these lines. 
Finally, the spy-1 allele exhibits a long hypocotyl 
phenotype, characteristic of hy2 mutants, while the, 
spy-4 and spy-5 alleles do not; and spy- J has a more 
severe early flowering phenotype than spy-2 even 
though the molecular lesions in these two mutants are 
very similar. Taken together and without intending to 
limit the scope of this invention, these results 
suggest that the spy-1 line carries mutations in both 
the SPY and HY2 genes, and that SPY and HY2 are 
separate but linked genes affecting partially 
overlapping aspects of growth and development. 
Because our original genetic analysis was performed 
using the spy-i line, we sought to confirm these 
results with the stronger spy-4 allele. 

Double mutants were constructed between 
spy-4 and mutants affected in either GA biosynthesis 
or GA response. Mutations at the GA1 locus, the 
gibberellin biosynthesis enzyme e^t-kaurene synthetase 
A (Sun, T., et al.(1994) Plant Cell 6:1509-1518.), 
block GA biosynthesis early in the synthesis pathway 
(Barendse, G.W.M., et al.(1986). Physiol. Plant. 
67:315-319 and Zeevaart, J.A.D. et al.{1992). In 
Progress in Plant Growth Regulation, eds . Karssen, 
CM., van Loon, L.C., & Vreugdenhil, D. (Dordrecht: 
Kluwer Academic Publishers), pp. 34-42*). The 
phenotypes of the strong gal-2 mutant include 
dwarfism, failure to germinate, male sterility, and 
incomplete petal development. These phenotypes were 
reversed by applied gibberellins (Koornneef M . , et al . 
(1980). Theor. Appl . Genet. 58, 257-263). 

As with the weaker spy-1 allele, spy-4 is 
largely but not completely epistatic to gal-2, spy-4, 
gal-2 double mutants do not require exogenous GA for 
seed germination, petal development, and male 
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fertility; but spy-4 gal-2 double mutant plant height 
is less than that of spy-4 single mutants. We have 
also found that, similar to the spy-1 gal-2 mutant, 
the spy-4 gal-2 double mutant still responds to 
5 exogenous GA treatment with an increase in plant 

height (S. Jacobsen, unpublished observation) . These 
results suggests that spy-4 plants activate a basal 
level of GA independent signal transduction, but that 
they still respond to changes in the levels of active 

10 GAs in the plant. 

To test the relationship between spy mutants 
and the semi dominant GA insensitive (gai) mutant 
(Koornneef, M . , et al.(1985). Physiol. Plant. 65:33- 
39), we constructed the spy-4, gai double mutant. 

15 Whereas a weak allele, spy-5, was only partially 

epistatic to gai spy-4 is completely epistatic to gai. 
This unambiguous epistasis allows these two genes to 
be ordered and suggests that GAI acts upstream of SPY. 

In summary, the results from these double 

20 mutant analysis together with the spy phenotype 
suggest that the wild type SPY product acts as a 
negative regulator of a portion of the GA signal 
transduction pathway that is common to all GA 
responses and that is downstream of both GA 

25 biosynthesis and the step affected in the gai mutant. 

Example 4 

Method and Identification of SPY gene in other plants 

There are at least two methods for 
identifying the SPY gene in other plants and for 

30 obtaining this gene as a clone for future transgenic 

studies. In the first method the SPY protein sequence 
was used to search sequence databases such as GenBank, 
EMBL and PIR to identify sequences capable of encoding 
similar proteins- Although searches of the public 

35 databases did not identify any plant sequences, a 
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search of a cDNA database at Pioneer Hi-Bred identified 
a partial cDNA sequence encoding a protein with 90% 
identity to a portion of the non-TPR region of the SPY 
protein. Figure 5 provides the alignment of the 
5 identified sequence and the region of homology between 
the SPY gene and a gene in maize. 

The second approach involves using the SPY 
cDNA as a hybridization probe to identify clones that 
hybridize to the probe under conditions of low (at 

10 least 70% identity), medium( at least 80%) identity or 
high (at least 90% identity) stringency. Hybridization 
of a labeled Clal fragment spanning the region from 
nucleotide 1601 to 3382 of the SPY cDNA to genomic 
blots containing DNA from turnip, maize and rice that 

15 had been digested with various restriction enzymes 

under conditions of low stringency detected hybridizing 
fragments in all of these species. Once a clone is 
identified, standard molecular techniques are used to 
identify the entire sequence of the gene. 

20 A low stringency hybridization is considered 

for these studies as a prehybridization in 0 . 5M NaPO^ 
(pH 7.0), 7% SDS, 1% BSA at 55°C for 2 hours, 
hybridization in the same solution at 55°C for 45 
hours, two washes for 15 min with 1 x SSC, 0.1% SC, 

25 0,1% SDS at room temperature followed by 2 washes for 

15 min with 1 x SSC, 0.1% SDS at 37°C, or its effective 
equivalent. A medium stringency hybridization is 
considered in this disclosure to be a prehybridization 
in 0.5M NaPO< (pH 7.0), 7% SDS, 1% BSA at 55°C for 2 

30 hours, hybridization with probe in the same solution at 
55°C for 45 hours, and 2 washes for 15 min each, with 5 
x SSC, 0.5% SDS at room temp., 2 washes 15 min. each 
with 1 x SSC inn 0.5-1.0% SDS at 37°C and one wash for 
15 min with 0.1 x SSC, 1.0% SDS at 37°C. A high 

35 stringency hybridization is considered to include a 



prehybridization in 0.5M NaP0 4 (pH 7.0) , 7% SDS, 1% BSA 
at 65°C for 2 hours, hybridization in the same solution 
at 65°C for 45 hours and 2 washes for 15 rnin each in 5 
x SSC, 0.5% SDS at room temperature, 2 washes at 15 
min. each with 1 X SSC, 0.5-1.0% SDS at 37°C and 3 
washes for 15 min each with 0.1 x SSC, 1.0% SDS at 65 
°C. Those skilled in the art will recognize that other 
salts and detergents could be used in other 
combinations to produce other formulations for low, 
medium and high stringency hybridization and wash 
conditions and the exact formulas provided here are not 
intended to detract from the spirit of the distinction 
between the various stringency hybridization 
conditions . 

Once the genes are identified they are 
introduced into suitable vectors with the appropriate 
enhancers, promoters and the like to direct facilitate 
incorporation of the gene into the plant and to permit 
expression of the exogenously-derived gene in the plant 
cells and tissues. 

For dicots the genes are preferably 
incorporated into Agrobacterium vectors under the 
direction of a suitable promoter and transgenic plants 
are created using techniques known in the art 
including, but not limited to the use of binary plant 
transformation vector pOCA 18 (Olszewski, et al. (1988) 
supra) using the methods of that reference or those of 
Czako, M et ai . (1986) Plant Mol. Biol. 6:101-109 and 
Jones, et al.(1985) EMBO J. 4:2411-2418). These clones 
are mobilized into A. tumefaciens strain AGL1 (Lazo, 
et al. 1991) by tri-parental mating (Olszewski, et al. 
1988, supra) and the resulting A . tumefaciens strains 
are used to generate transgenic plants using the 
methods of Valvekens, et al . (1988, supra). 
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For monocots the genes are incorporated into 
a suitable vector under the control of a suitable 
promoter, known to those skilled in the art of monocot 
transformation. The vector is incorporated into the 
monocot using techniques known in the art, including, 
but not limited to those disclosed in PCT Publication 
number WO 91/02071 to Adams, et al., those of European 
Patent Application Publication No. 586 355 A2 , or Wan, 
et al. (1994) Plant Physiol 104:37-48* Those skilled 
in the art will recognize that there are a variety of 
methods known to create transgenic dicots and monocots 
and that these methods do not detract from the scope or 
content of this invention. 

Alternatively it is possible to search 
nucleic acid and protein databases directly such as 
PIR-Protein, SWISS-PROT, GenBank and EMBL database with 
the deduced SPY protein from residues 431-914 for 
similar protein sequences or nucleic acid sequences 
capable of encoding a protein containing a similar 
region (>30% identity). Types of sequences searched 
should include cDNA, genomic protein and expressed 
sequence tags (ESTs). Once a positive identification 
is made, the identified sequence is assessed for the 
presence or absence of TPRs . No plant genes were 
identified in these databases to date with the 
characteristics of the genes of this invention, 
however, as further information is added to the 
databases, these searches may prove useful to the 
identification of other SPY genes. 

Using a Bglll restriction length fragment 
polymorphism present in cosmid 4111, the SPY locus was 
mapped on the Landsberg erecta X Columbia mapping 
lines in the laboratory of Dr. Elliot Meyerowitz as 
described in (Chang, C, et al.{1988) Proc. Natl. 
Acad. Sci. U.S.A. 85:6856). 

35 



WO 97/43419 PCT/US97/08765 



Example 5 
Overexpression contructs of SPY 

Overexpression of the SPY protein or a 

5 portion of the SPY protein, such as the fragments 

disclosed throughout this document, can be achieved by 

either overexpressing the transcribed region of the SPY 

gene or by oberexpressing the protein coding region of 

the SPY cDNA. Overexpression employing the gene 

10 sequences can be accomplished by replacing the SPY 
promoter with a stronger promoter such as the 35S 
promoter of Cauliflower mosaic virus (CaMV) and 
introducing the resulting gene into plants. Briefly, 
the transcribed region can be cloned into a plant 

15 transformation vector such as pBI121 (Clonetech 

Laboratories, Palo Alto, CA) in place of the beta- 
glucuronidase coding region in the sense orientation 
relative to the 35S promoter. Since pBI121 is a binary 
transformation vector, this construct can be introduced 

20 into Agrobacterium. The resulting strain can then be 
used to transform dicot plants by a protocol 
appropriate to the species being transformed. These 
methods are well-known and the literature provides 
preferred transformation strategies for a wide variety 

25 of plant species. 

For overexpression using the cDNA, the cDNA 
is cloned in the sense orientation in place of the 
beta-glucuronidase coding region of pBI121. In this 
construct, the SPY coding region is flanked by the 35S 

30 promoter, which drives expression in plants, and the 
downstream NOS terminator, which terminates 
transcription. Given the large number of introns in 
the SPY gene, which must be removed during the 
formation of the mature mRNA, overexpression of the SPY 
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cDNA which lacks the introns is the preferred method of 
overexpression . 

Antisense constructs will be produced by 
replacing the beta-glucuronidase coding region of 
5 pBI121 with the SPY cDNA in an inverted orientation 

relative to the CaMV 35S promoter. This will allow the 
CaMV 35S promoter to drive the expression of antisense 
SPY RNA and the NOS terminator to terminate the 
production of this transcript. 

10 References cited in this disclosure are 

incorporated herein by reference to the extent that 
they supplement, explain, provide a background for or 
teach methodology, techniques or compositions employed 
herein. Although the invention has been described in 

15 the context of particular embodiments, it is intended 
that the scope of coverage of the patent not be 
limited to those particular embodiments, but be 
determine by reference to the submitted claims. 
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What is Claimed is: 

1 . An isolated nucleic acid fragment comprising at 
least nucleic acids 2175-2405 of SEQ ID NO:l. 

2 . A vector comprising a promoter and a nucleic acid 
fragment, comprising at least nucleic acids 2175-2405 
of SEQ ID NO:l. 

3. Transgenic plants expressing protein encoded by 
an exogenously-deri ved nucleic acid fragment 
comprising at least nucleic acids 2175-2405 of SEQ ID 
NO: 1 . 

4. A cell containing the vector of Claim 2. 

5. The vector of Claim 2, wherein the promoter is 
plant tissue specific. 

6. The vector of Claim 2, wherein the promoter is 
specific for fruit expression. 

7. The vector of Claim 2, wherein the promoter is 
specific for green tissue expression. 

8. A transgenic plant comprising an exogenously- 
derived nucleic acid fragment of at least nucleic 
acids 2175-2405 of SEQ ID NO:l. 

9. A transgenic plant expressing a polypeptide 
encoded by an exogenously-derived nucleic acid 
fragment comprising at least amino acids 609-685 of 
SEQ ID N0:2. 

10. A transgenic plant expressing a polypeptide, 
where the polypeptide is encoded by an exogenously- 
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derived nucleic acid fragment capable of hybridizing 
under highly stringent hybridization conditions to an 
isolated nucleic acid fragment comprising at least 
nucleic acids 2175-2405 of SEQ ID N0:1. 

11. An isolated nucleic acid fragment comprising 
nucleic acids 579-1640 of SEQ ID N0:1. 

12. A vector comprising a promoter and a nucleic acid 
fragment, comprising at least nucleic acids 579-1640 
of SEQ ID NO:l. 

13. Transgenic plants expressing protein encoded by 
an exogenously-der i ved nucleic acid fragment 
comprising at least nucleic acids 579-1640 of SEQ ID 
NO: 1 . 

14. A cell containing the vector of Claim 12. 

15. An isolated nucleic acid fragment encoding the 
polypeptide of SEQ ID NO:2. 

16. A method for identifying a gene from a plant 
comprising the steps of: 

identifying at least a first clone from a 
cDNA library from a plant that hybridizes under 
low stringency hybridization conditions to a 
probe comprising nucleic acids 1641-3092 of SEQ 
ID NO : 1 , wherein the clone contains at least a 
portion of an open reading frame encoding a 
polypeptide with a C-terminus having at least 50 
% amino acid homology to amino acids 609-685 of 
SEQ ID NO 2; 

obtaining a full-length clone, wherein the 
full-length clone has about 100% nucleic acid 
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homology to the open reading frame of said first 
clone; and 

determining if the full length clone 
contains at least one tetratricopeptide repeat 
region . 

17. The method of Claim 16, wherein the determining 
step additionally comprises determining if the 
tetratricopeptide repeat region of the full-length 
clone hybridizes under stringent hybridization 
conditions to a nucleic acid fragment consisting of 
nucleotides 579-1640 of SEQ ID NO:l. 

18. An isolated recombinant gene selected by the 
method of: 

identifying at least a first clone from a 
cDNA library from a plant that hybridizes under 
low stringency hybridization conditions to a 
probe comprising nucleic acids 1641-3092 of SEQ 
ID N0:1, wherein the clone contains at least a 
portion of an open reading frame encoding a 
polypeptide with a C-terminus having at least 30 
% amino acid homology to amino acids 609-685 of 
SEQ ID NO 2; 

obtaining a full-length clone, wherein the 
full-length clone has about 100% nucleic acid 
homology to the open reading frame of said first 
clone; and 

determining if the full length clone 
contains at least one tetratricopeptide repeat 
region . 
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19. The gene of Claim 18, wherein the open reading 
frame of the gene has at least a 70% homology to SEQ 
ID NO:l. 

20. The gene of Claim 18, wherein the open reading 
frame of the gene has a tetratricopeptide repeat 
regions that that hybridizes under low stringency 
hybridization conditions to a nucleic acid fragment 
consisting of 579-1640 of SEQ ID NO:l and has at least 
50% homology to amino acids 77-430 of SEQ ID NO:2. 

21. The gene of Claim 19, wherein the open reading 
frame of the gene hybridizes under stringent 
hybridization conditions to a nucleic acid fragment 
consisting of nucleic acids 2175-2405 of SEQ ID NO:l. 

22. The gene of Claim 19, wherein introduction of a 
vector capable of directing expression of the gene 
into a wild-type plant results in a spindly phenotype. 

23. A method for identify a gene from a plant 
comprising the steps of: 

searching a gene database for a nucleic acid 
sequence encoding a polypeptide from a plant and 
having at least 30% amino acid homology to the 
amino acid fragment corresponding to amino acids 
609-685 of SEQ ID N0:2; and 

determining whether the open reading frame 
includes a tetratricopeptide repeat region. 

24. The method of Claim 23, wherein after the 
searching step the method additionally comprising the 
steps of : 
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using the nucleic acid sequence identified 
from the database as a probe to obtain a cDNA 
clone containing an open reading frame that 
includes the sequence identified from the gene 
database; and 

sequencing the cDNA clone. 

25. An isolated gene from a plant encoding a 
polypeptide where a portion of the polypeptide has a 
30% amino acid homology to amino acids 609-685 of SEQ 
ID NO: 2, wherein the polypeptide encoded by the gene 
includes a tetratricopept ide repeat region and wherein 
introduction of a vector directing expression of the 
gene into a plant produces a transgenic plant with a 
spindly phenotype . 

26. A transgenic plant comprising an exogenously 
derived nucleic acid fragment comprising SEQ ID NO:l. 

27. An isolated nucleic acid fragment comprising at 
least nucleotides 1641-3092 of SEQ ID NO : 1 . 
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FIGURE 2 



Sequence of the SPV Protein 



MVGLEDDTERERS PWENGFSNGSRS S S S S AGVLS P SRKVTQ GKDTL S YA 50 
NILRARNKFADAIxALYEAMLEKDSKN 76 

VEAHIGKGICLQTQNKGNLAFDCFSEAIRLDPHN 110 

ACALTHCGILHKEEGRLVEAAESYQKAL1«IADASYKPAAECL 151 

AIVLTDLGTSLKLAGNTQEGIQKYYEALKIDPHY 185 

APAYYNL.G WY S EMMQYDNALS C YEKAALERPMY 219 

AEAYCNMGVIYKNRGDLEMAITCYERCLAVSPNFEIAI<^INM 260 

AI ALTDLGTKVKLEGDVTQGVAYYKKALYYNWHY 294 

ADAMYNLGVAYGEMLKFDMAI VFYELAFHFNPHC 328 

AEACNNLGVLi YKDRDNLDKAVEC YQ I4AL S I K PNF 362 

AQSLMNLGVVYTVO GKMDAAASMIEKATLANPTV 395 

AEAFNNLGVLYRDAGNITMAIDAYEECLKIDPDS 430 

A A NLGV YK G A YE AL P Y Consensus 

RNAGQ^mLLAMNYIOT:GLDDKLFEAHRDWGWRFTRI J HPQYTSWDNLKDPE 4 8 0 

RPITIGYISPDFFTHSVSYFIEAPLTHHDYTKYKWVYSAWI-CADAKTYR 53 0 

FRDKVLKKGGVWKDI YGIDEKKIASMVREDKIDILVELTGHTANNKLGTM 580 

ACRPAPVQVTTa'IGYPNTTGLPTVDYRITDSLADPPDTKQKQVEELVRLPD 63 0 

C FLC YT PS PEAG PVC PTPALSNGFVTFG S FNNLAK I T PKVLQVWAR I LC£ 680 

VPNSRLWKCKPFCCDS IRQRFLTTLEQLGLESKRVDLLPLILFNHDHMQ 730 

AYSU^ISLDTFPYAGTTTTCESLYMGVPCVTMAGSVHAHNVGVSLLTKT/ 780 

GI/3HLVAKNEDEYVQLSVDLASDVTALSI<LRMSLRDLMAGSPVCNGPSFA 830 

VGLESAYRNl^TKKYgKGEVPSLRRMEMLQKEVHDDPLISICDLGPSRVSVT 880 

GEATPSLKANGSAPVPSSLPTQSPQLSKRMDSTS 914 
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FIGURE 3 A 
Sequence of the SPV cDNH 



1 


GGAGAGCTAA 


TCTGAGTCGT 


TGC CTCTGTC 


CACTGGCTCG 


GACCGACCTC 


51 


GTACCTCTTT 


CATTTGTCCT 


C G AG CTTATA 


ATAGAGGCTA 


CGCGTCGCCG 


101 


CCACCTCCGC 


TCCATCCATT 


CACGGCCAAG 


CGACGACTCC 


ACCGCTTAGG 


151 


CTTGGCGTCT 


GAGGTATACT 


GATCAACGCT 


TTCTGTTTTC 


GGAGGAGCGA 


201 


GGCGAGATCA 


GCTTCTCTTA 


TGTCTACTCA 


AACATATCAT 


TCTTCTTTAG 


251 


CCACTCGATT 


TCTTCTCTAG 


CGGCTCCAGA 


GGTTTCGTCG 


CTACAGAGTC 


301 


AAGTTCCTCT 


TTTCAGGTTT 


TGTGGTGAAC 


AAGATTTTAG 


TTACAAAAAA 


351 


ATGGTGGGAC 


TGGAAGATGA 


TACTGAGAGA 


GAGAGGTCAC 


CAGTCGTAGA 


401 


GAATGGTTTT 


TCCAATGGGT 


CTCGGTCTTC 


TTCTTCTAGC 


GCAGGTGTTT 


451 


TGTCTCCATC 


ACGAAAGGTC 


ACTCAGGGGA 


ACGATACACT 


TTCTTATGCC 


501 


AATATTCTTC 


GGGCAAGAAA 


CAAGTTTGCT 


GATGCGCTTG 


CTCTCTATGA 


551 


GGCTATGCTG 


GAGAAAGATA 


GCAAGAATGT 


TGAAGCTCAC 


ATTGGAAAAG 


601 


GGATATGCTT 


GCAGACGCAG 


AACAAAGGGA 


ATCTAGCTTT 


CGATTGTTTT 


651 


TCTGAAGCGA 


TCAGGTTGGA 


TCCGCATAAT 


GCTTGTGCCC 


TTACACACTG 


701 


TGGTATACTT 


CATAAAGAAG 


AAGGACGCCT 


CGTAGAAGCT 


GCTGAGTCCT 


751 


ACCAGAAAGC 


ATTC-ATGGCA 


GATG CATC AT 


ACAAGCCAGC 


AGCAGAGTGT 


801 


TTAGCCATTG 


TTTTGACCGA 


CCTTGGAACT 


AGCCTGAAGC 


TGGCTGGGAA 


851 


TACTCAGGAA 


GGAATTCAAA 


AGTATTACGA 


AGCCCTTAAG 


ATTGACCCAC 


901 


ACTATGCTCC 


TGCATATTAC 


AACTTAGGTG 


TTGTATACTC 


CGAAATGATG 


951 


CAATATGACA 


ATGCCTTGAG 


CTGCTACGAG 


AAGGCTGCAC 


TTGAG^GGCC 


1001 


TATGTATGCT 


GAAGCATATT 


GTAACATGGG 


TGTCATTTAT 


AAG AAC CG TG 


1051 


GTGACTTGGA 


GATGGCAATC 


ACTTGTTATG 


AGAGATGTCT 


AGCTGTGTCT 


1101 


CCAAACTTTG 


AGATTGCGAA 


GAACAATATG 


GCCATAGCTC 


TGACAGATTT 


1151 


AGGAACAAAG 


GTTAAACTTG 


AAGGCGATGT 


AACCCAAGGA 


GTGGCATATT 


1201 


ACAAGAAGGC 


TCTCTATTAT 


AACTGGCACT 


ATGCAGATGC 


TATGTACAAT 


1251 


CTTGGGGTGG 


CTTATGGAGA 


AATGCTAAAG 


TTCGACATGG 


CGATTGTCTT 



WO 97/43419 PCT/US97/0876S 

4/10 

FIGURE 3B 



1301 


CTATGAGCTT 


GCTTTCCACT 


TTAATCCA.CA 


TTGTGCTGAG 


GCTTGCAACA 


1351 


ATTTGGGAGT 


ACTTTACAAA 


GACCGTGACA 


ACCTTGATAA 


AGCTGTGGAG 


1401 


TGTTATCAGA 


TGGCTCTATC 


AATCAAACPA 


AATTTTGCAC 


AGTCGCTTAA 


1451 


TAACCTTHGT 


GTCGTCTATA 


CAGTCCAGGG 


GAAAATGG.-.T 


GCTGCTGCCA 
AGAAGCTTTT 


1501 


GCATGATTGA 


GAAGGCTATA 


A (spy-1) 
CTTGCTAATC CCACATATGC 


1551 


AACAACCTAG 


GTGTTCTTTA 


CAGAGATGCT 


GGAAATATAA 


CTATGGCTAT 


1601 


CGATGCTTAT 


GAGGAATGCC 


TTAAGATAGA 


TCCAGATTCT 


CGCAATGCTG 


1651 


GCCAGAACCG 


ATTGCTTGCC 


ATGAATTACA 


TAAATGAAGG 


ACTCGATGAC 


1701 


AAACTATTTG 


AGGCTCACAG 


AGACTGGGGT 


TGGCGCTTCA 


CAAGATTACA 


1751 


CCCTCAATAC 


ACTTCATGGG 


ATAATCTGAA 


AGATCCAGAG 


CGACCTATCA 


1801 


CCATTGGATA 


TATCTCACCA 


GATTTCTTCA 


CTCATTCAGT 


ATCTTATTTC 


1851 


ATTGAAGCTC 


CCCTTACGCA 


TCATGATTAT 


ACAAAGTACA 


AAGTGGTGGT 


1901 


TTATTC AG CG 


GTAGTTAAGG 


CAGATGCAAA 


AAC ATAC AG 3 


TTTAGGGATA 


1951 


AAGTGTTGAA 


GAAAGGTGGA 


GTTTGGAAGG 


ATATATACGG 


GATAGATGAG 


2001 


AAAAAGATAG 


CAAGTATGGT 


CAGAGAGGAC 


AAAATCGACA 


TTTTGGTAGA 


2051 


ACTCACTGGT 


CA7ACGGCAA 


ACAACAAGTT 


GGGAACAA7G 


GCCTGCAGAC 


2101 
2151 


CAGCACCGGT 
CCCACTGTTG 


TCAGGTTACT 
ATTACAGAAT 


TGGATCG3CT ATCCAAATAC 

A {spy- 3) 
TACAGATTCG TTGGCTGATC 


7ACGGGTTTG 
C ^CC a G ATAC 


2201 


CAAACAAAAG 


CAGGTCGAGG 


AGCTGGTTAG 


GCTTCCGGAC 


- Vj 1 1 i 1L 1 1 1 


2251 


GTTATACACC 


TTCCCCAGAG 


GCTGGTCCTG 


TTTGTCCAAC 




2301 


*. \— inn X uVJL ± 




TGGTAGTTTC 


AACAACCTCG 


CAAAGATAAC 


2351 


TCCTAAGGTG 


CTGCAAGTGT 


GGGCTAGGAT 


ACTGTGTG C A 


GTTCCCAATT 


2401 


CTCGTCTAGT 


GGTAAAATGC 


AAACCTTTCT 


GCTGCGATAG 


CATTAGGCAG 


2451 


AGGTTTCTCA 


CCACGCTGGA 


GCAGCTTGGG 


TTAGAATCAA 


AGCGTGTTGA 


2501 


TCTCTTGCCT 


CTGATTCTTT 


TCAATCACGA 


CCATATGCAA 


GCCTATTCCT 


2551 


TGATGGATAT 


TAGTTTGGAC 


ACATTCCCTT 


ATGCTGGAAC 


TACCACTACC 
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2 fini 

£• \J \J X 




TCT ACATGG G 


AGTTCCATGT 


9 i 


AC ATG C TC AT 


AATGTTGGTG 


TCAGTCTTCT 


z / u j. 


ACCTGGTTGC 


CAAAAATGAG 


GATGAGTATG 


■<£ / O J. 


GCCTCTGATG 


TCACAGCTCT 


TTCTAAATTG 


9 rat 


AATG G CTG G.s 


TCTCCTGTTT 


GTAATGGTCC 


2851 




TPPP & TV rri/-> 

1 LuoAA 1 A, 1 


TGGAAAAAGT 


2901 


TCCTTAAGGC 


nAATfSflAAAT 


^ PfO O 7\ T\ 7. 7\ 7, 


2951 


AATCTCAAAA 


n a p"T v rr*pp a p* 


v- A I UAAGAGT 


3 001 




t- AA(j(jC laat 


GGTTCTGCTC 


3 051 




CGCAGCTCTC 


AAAGAGAATG 


3101 


CAGCAAATCG 


AGCTGCTGCG 


AAATGCCGGC 


3 151 


GAAAAGGTGT 


GAAAGAAAGA 


GTCGATGAGC 


3 2 01 


AAGACAATAG 


GAACTAGACT 


TT AG ATTAC T 


3251 


AGTAAAACCA 


GCTCTTTCTT 


TTGTTGTATC 


3301 


CTTTACATGA 


TTCTTGGGAA 


GTCGTTAGGT 


3351 


TCTTCTCATT 


TGAGAGATCA 


AGTTGTTGTG 


3401 


CTTTTTAGGA 


TGTTTTCATG 


TGTTGGATTT 


3451 


ATAGTTATAG 


AAAGCTTTTC 


GGTGCC 
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GTTACAATGG CTGGCTCAGT 

CACTAAAGTT GGATTAGGAC 

TTCAGCTATC TGTTGATCTA 

AGAATGAGTC TCCGGGATCT 

TTCCTTTGCT GTTGGTCTTG 

ACTGCAAAGG TGAAGTGCCG 

A (spy- 5) 
GAGGTCCATG ATGATCCCTT 

CAGCGTTACT GGAGAAGCCA 
CTGTACCTTC CTCTTTACCA 
GACTCCACTA G C TAG ATAAC 
AGAGAGTCTT GACCCATCTG 
TTTTCCTGCT ATTTACTTCC 
GCTTGTGTAG TAAAAAGAAT 
TCTTTCTACT CTTAGTTTAG 
GGTAG TGGAT TTGGAGTTTT 
TATCGATTAG GGTTTTAAGG 
TGACTCATAT G AT AG T AAAT 
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FIGURE 4 A 
Sequence of the SPY Genomic Region 

1 TCTAGACTAG TTTCATAGTC CATGAAAAAA ACATCAAATC TCCTAAATGG 

51 CTGGACATAA TTCAGATGAT TTTGTATGAA ATAAAACATA AAACATATAT 

101 TTTCTTGCAT ATCTGGAGAT TTTTGTTTCT TTATTACAAT GACTAATTAA 

151 TTTACCTTGT TCCTTTTTAT TTTTGCAAAA TGATTACTGT TATCTATTTT 

201 GTCCGTTTCT AiYT,\AAATAA AATJACAATC TGAGCTGTGG AAAGAAAAAA 

^51 AAAGGAAAAG CGAGGAGAGC TAATCTGAGT CGTTGCCTCT GTCCACTGGC 

301 TCGGACCGAC CTCGTACCTC TTTCATTTGT CCTCGAGCTT ATAATAGACG 

351 TACGCCGTCG CCGCCACCTC CGCTCCATCC ATTCACGGCC AAGCGACGAC 

401 TCCACCGCTT AGGCTTGGCG TCTGAGGTAT ACTGATCAAC GCTTTCTGTT 

451 TTCGGAGGAG CGAGGCGAGA TCAGCTTCTC TTATGTCTAC TCAAACATAT 

501 CATTCTTCTT TAGCCACTCG ATTTCTTCTC TAGCGGCTCC AGAGGTTTCG 

551 TCGCTACAGA GTCAAGTTCC TCTTTTCAGG TTTTGTGGTA AGTAATCGTT 

601 AAACCCTAAG TATCGGACCT TGTTGTTTAA TCTGTTCGTG TTTACTCTCA 

651 ATTACATATG CATTCTTCTG CTTAATCGTT TCTTTTAGTT TAATTTCTAG 

701 GGTTTACATC CCAAAGGTCT GATCTTTTTG CATATTTGTG TGAATCTTAG 

751 TTTTTTTTTT TTTTTTTGGA ATTGAATGTG ATGAGTTGGG TTTGATACTG 

801 TTAAAGATCA AATCTTTAGC TTCGTTGAAG CTTCATATTT ATC-TCAACAA 

851 TGCAAGGTTT ATTTTTCTTC CACCTTTGAT TTGATATTTA TAATTGTTTC 

901 TTTGAAGGTG AACAAGATTT TAG TT AC AAA AAAATGGTGG GACTGGAAGA 

951 TGATACTGAG AGAGAGAGGT CACCAGTCGT AGAGAATGC 7, TTT^CCAATG 

1001 GGTCTCGGTC TTCTTCTTCT AGCGCAGGTG TTTTGTCTCC ATCACGAAAG 

1051 GTCACTCAGG GGAACGATAC ACTTTCTTAT GCCAATATTC TTCGGGCAAG 

1101 AAACAAGTTT GCTGATGCGC TTGCTCTCTA TGAGGCTATG CTGGAGAAAG 

1151 ATAGCAAGAA TGTTGAAGCT CACATTGGAA AAGGGATATG CTTGCAGACG 

1201 CAGAACAAAG GGAATCTAGC TTTCGATTGT TTTTCTGAAG CGATCAGGTT 

1251 GGATCCGCAT AATGCTTGTG CCCTTACACA C TGTGG TATA CTTC£ TAAAG 

1301 AAGAAGGACG CCTCGTAGAA GCTGCTGAGG TGCAACATTA C ATT A CCTTC 

13 51 TATCTGTGAT GATTTGCATT AG AGGGTG CT GCATTAGTTA GACCATTGAA 

1401 CTTGTTAAAT TGGTGATATG CAATTATGCA TTAGGTTTT* TGCTAGGTAA 

1451 TCAGTTTCTA ACGATTAATC ATCATATTTT GCACAGTCC7 ACCAGAAAGC 

15 01 ATTGATGGCA GATGCATCAT ACAAGCCAGC AGC AG AGTG T TTAGCCATTG 

1551 TTTTG AC CG A CCTTGGAACT AGCCTGAAGC TGGCTGGG- - ^ACTCAGGAA 

1601 GGAATTCAAA AGTATTACGA AGCCCTTAAG ATTGACCCAC PCTATGCTGT 

1651 AATTTTCTGT TCCTCTACCA TTTCACACTC TTGGTACCAT TTAACTGATT 

1701 CTCTAATTCA GTATGTTATA ATATATTTAT GCAGCCTGC- ^ATTACAACT 

HI} TAGGTGTTGT ATACTCCGAA ATG ATG CAAT ATGACAATGC CTTGAGCTGC 

1801 TACGAGAAGG CTGCACTTGA GAGGCCTATG TATGCTGAAG CATATTGTAA 

1851 CATGGGTGTC ATTTATAAGA ACCGTGGTGA CTTGGAGATG GCAATCACTT 

1901 GTTATGAGAG GTAGCATATC TGTTAATTCA TCTCTAACTG TTGACTGGTT 

1951 TCTTGCTACT TTGTTGAACG TGCAAGTAAG GCGCTGATT- TTTTC TCTTC 

?SSi rj?^TS CC TTT - GA TGTC TAGCTGTGTC TCCAAACTT7 SSSSS 

??nJ ^£ ACAATAT GGCCATAGCT CTGACAGATT TAGGAACAAA GGTAAGAATC 
G ™- AATTrT ATCACAATTT ATAACTCAAG TATACTTTTT GTAAGGGGCG 

III, ^^CTGGA.A AATTCGTTAT AAAACTTCGT TTTGTTTAGC TCCCCTTTGT 
l^S^l Q CTTTGTACTT ATGTCACGGC AATGGCATTG GwSrtGTTT 

III, ™TT^ TTA CTAGTGAACT TTTGCGCTGA ATAATTTTGA TTTGCAGTTT 
^^ CCTT CTTTTCCATT GGCGAGAAGC TGTTCAGCTG TGAGTACATC 

III} TS^r GTCA AATGTCAATG ATATTTCAGG TTAAAC TTG A AGGCGATGTA 

IJSi ^^ AAGGAG TGG CATATTA C AAGAAGG CT CTCTATTATA ACTGGCACTA 

lil} ^o A S ATGCT ATGTACAATC TTGGGGTGGC TTATGGAGAA ATGCTAAAGT 

2501 TCGACATGGT ATTTAATTTG TGATTTGTTC ATTTCTGTA a GTCAGTAATG 
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2 551 GTGTGGTTGT TATCG CGTGT TTATCCTTTC CTCGCCACTT 7ACTCGCTTG 

2 601 ATAAAATG AT ATATATCTTG ACTAGTTTAT CTACCTAGAT 777TATCCTT 

2 651 CTCCACATGT TCTCGTAATT AATCCAAAAC TCTGTATG7A GA7CTCTATA 

2701 TTATAATGGA ATTGTAGAGC CAAAGAATGA AATATGTCTG 73 3TCATGAT 

2751 TGCATTCTCA ATGTGCAGGC GATTGTCTTC TATGAGCT7G 3777CCACTT 

2 801 T AATC C AC AT TGTGCTGAGG C TTG C AA C AA TTTGGGAGTA 377TACAAAG 

2 851 ACCGTGACAA CCTTGATAAA GCTGTGGAGT GTTATCAGGT ;_\7ATTTTTG 

2 9 01 CAGATATCTG TAGCGTTTCA TCAGAATTTC ATTGTGTTTG G7GGCTTATT 

2 951 ATATCTCCCA ACCTATGTAG ATGGCTCTAT CAATCAAACC AA-ATTTTGCA 

3 001 CAGTCGCTTA ATAACCTTGG TGTCGTCTAT ACAGTCCAGG 7 7 7G AT AT AT 
3 051 ATTAAGGGCG GCTTAATGTT TTCTTAATTG AATCTCCTAA G7CGATAGAA 
3101 TGCCAATTCC TCTGATATTA CAGGGGAAAA TGGATGCTGC 7GCCAGCATG 
3151 A.TTGAGAAGG CTATACTTGC TAATCCCACA TATGCAGAAG C7TTTAACAA 
3201 CCTAGGTCTG TTTTCTCATC TTCTGTTCTT TACGAGCTTC CTCACGTGTT 
3 251 ACAACTGCTT AG AAAC TATA TTCCTTTGAA ATTTAGATTT 7ATGTTTGTC 
3301 CTTTTGTTTC TACCTCCCTG GCGCTAAGAG TCTTGTAGTG 7CTGTGATAA 
3 351 CCAGTTTCAT GGTGCGATTC AAATGTAGGT GTTCTTTACA GAGATGCTGG 
3 401 AAATATAACT ATGGCTATCG ATGCTTATGA GGAATGCCT7 .-.-.GAT AG ATC 
3 4 51 CAGATTCTCG CAATGCTGGC CAGGTATCTA TACTTTAGCG 7GGTCTTCTT 
3 501 GTTATGAGGT TGAAAGATAT ATGTGTTTAA AACCTTCTTG 7CCCCTTTTG 
3 5 51 TAGAACCGAT TG CTTG C CAT GAATTACATA AATGAAGGAC 7CGATGACAA 
3 601 ACTATTTGAG GCTCACAGGT AAGCACATAT ATTATTAATG TAGATTTGTA 
3 651 TTATGTTGCT T7TATGGGTC TTACAGT7GAA AAAATCTTC 7 GAACACAGAG 
3 7 01 ACTGGGGTTG GCGCTTCACA AGATTACACC CTCAATACAC 773ATGGGAT 
3 751 AATCTGAAAG ATCCAGAGCG ACCTATCACC ATTGGATATA 7 7 7C A C C AG A 
3 801 TTTCTTCACT CATTCAGTAT CTTATTTCAT TGAAGCTCCC 377ACGCATC 
3 851 ATGATTATAC AAAGTACAAA GTGGTGGTTT ATTCAGCGG7 A37TAAGGTA 
3 9 01 GGATTTTTAC CTATATAACT TATATAGATA CATTTTCCC7 37AA.GCAATT 

3 9 51 CATTTCCTGG TTCTCGTGGC ATTTTTCCCT TTTGAGCAA7 CA7TGGTCTC 

4 001 TCATGGCTTT GCAGGCAGAT GCAAAAACAT ACAGGTTTAG G3ATAAAGTG 
4 0 51 TTGAAGAAAG GTGGAGTTTG GAAGGATATA TACGGGATA1- A7 3AGAAAAA 
4101 GATAGCAAGT ATGGTCAGAG AGGACAAAAT CGACATTTTC- G 7 AGAACTCA 
4151 CTGGTCATAC GGCAAACAAC AAGTTGGGAA CAATGGCCTG 7 A 3 A C C AG C A 
4 201 CCGGTTCAGG TGAGAGGATA TATTAAACCT ATC TC A TTT 7 3773TTTCGG 

42 51 GTTTTGCCTT TGACTTTCCA TTTCAAGTGT ACTTATAT7 3 3 37AAGATAC 

43 01 CAGGTTACTT GGATCGGCTA TCCAAATACT ACGGGTTTGC 3 3A3TGTTGA 
4 3 51 TTACAGAATT ACAGATTCGT TGGCTGATCC ACCAGATAC3 AJAACAAAAGT 
4401 ACGTTTTGGT TCAAGATGCA ATTTTGGGTT TCGGAAGTG3 7 3 3 AAA T AAA 
4451 AA.TCTTAATT TTTATTTATT TATTTTGTGA TATTTGATTG C A3 3CAGGTC 
4 501 GAGGAGCTGG TTAGGCTTCC GGACTGTTTT CTTTGTTATA 3A33TTCCCC 
4 551 AGAGGCTGGT CCTGTTTGTC CAACACCTGC GCTTTCTAA7 3 3 37TTGTCA 
4 6 01 CATTTGGTAG TTTCAACAAC CTCGCAAAGG TTAAAAAA77 7 37GTCCTTG 
4 651 GATTATGCAC ACCAATCTCC CCTAGTATCT CTTTCAATG7 777GACAGGT 
47 01 TTATCTCTGT TTGTG C AAAT CAGATAACTC CTAAGGTGC7 33AAGTGTGG 
47 51 GCTAGGATAC TGTGTGCAGT TCCCAATTCT CGTCTAGTG 3 7 A-AAATG CAA 

4 801 ACCTTTCTGC TG CG AT AG C A TTAGGCAGAG GTTTCTCAC7 A3 3CTGGAGC 
4851 AGCTTGGGTT AGAATCAAAG CGTGTTGATC TCTTGCCTT7 G A 7 TC TTTTC 
49 01 AATCACGACC ATATGCAAGC CTATTCCTTG ATGGATATTA G 3 7 AAG ATTT 
4951 GACACATAGT GCTCTGTAAA ACACCGAGGC TTATAGATTC A3ATATTTAA 
5001 TTTACATTTA TTGCAGTTTG GACACATTCC CTTATGCTGG A-ACTACCACT 

5 051 ACCTGTGAGT CTCTCTACAT GGGAGTTCCA TGTGTTACAA 7GGCTGGCTC 
5101 AGTACATGCT CATAATGTTG GTGTCAGTCT TCTC AC T AAA G 7 J GGTAAGC 
5151 TCTTAGCAAA ATTTTTTTTT TTTTTTTTGC AAAAATTGT7 GTTAGTCGAC 
5201- ATCTTTTAGC TAATTCAGCC ATTTCTTGAT TCAGGATTAG GA~ACCTGGT 
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52 51 TGCCAAAAAT GAGGATGAGT ATGTTCAGCT ATCTGTTGAT CTAGCCTCTG 

53 01 ATGTCACAGC TCTTTCTAAA TTGAGAATGA GTCTCCGGGA TCTAATGGCT 

53 51 GGATCTCCTG TTTGTAATGG TCCTTCCTTT GCTGTTGGTC TTGAATCCGC 

54 01 ATATCGGAAT ATGTGGAAAA AGTACTGCAA AGGTGAAGTG CCGTCCTTAA 
5451 GGCGAATGGA AATGCTGCAA AAAGAGGTCC ATGATGATCC C 7TAATCTC A 
5501 AAAGACTTGG GACCATCAAG AGTCAGCGTT ACTGGAGAAG CCACTCCGTC 

55 51 TCTCAAGGCC AATGGTTCTG CTCCTGTACC TTCCTCTTT.-, CCAACCCAAT 
5601 CCCCGCAGCT CTCAAAGAGA ATGGACTCCA CTAGCTAGAT AACCAGCAAA 
5651 TCGAGCTGCT GCGAAATGCC GGCAGAGAGT CTTGACCCAT CTGGAAAAGG 
57 01 TGTGAAAGAA AG AG TCG ATG AGCTTTTCCT GCTATTTAC? 7CCAAGACAA 
5751 TAGGAACTAG AC TTT AG ATT ACTGCTTGTG TAGTAAAAAG AATAGTAAAA 
5801 CCAGCTCTTT CTTTTGTTGT ATCTCTTTCT ACTCTTAGTT TAGCTTTACA 
5 851 TGATTCTTGG GAAGTCGTTA GGTGGTAGTG GATTTGGAGT TTTTCTTCTC 
5901 TiTTTGAGAGA TCAAGTTGTT GTGTATCGAT TAGGGTTTTA AGGCTTTTTA 
5951 GGATGTTTTC ATGTGTTGGA TTTTGACTCA TATGATAGTA AATATAGTTA 
6001 TAGAAAGCTT TTCGGTGCCT TTACCTATTT CATAATATAA TTATCTAAAA 
6051 CTCCTGCTTA AGCTTAATCC CATAGGTGAG ACCCAATGAA AGACTTTTGA 
6101 CTTGTATGAT TATGTTGCCA ACTCCATATC TCTCTTTAAT TAATTAATAT 
6151 GGAGAGAGTA AAAAGGCAAG CAACTAACTA CTCTTCAATT CAACACTTCT 
6201 TCTGCTCCAT TGCTTCCACC AGAACCCTAA AGACCTTATA AAACCCAATC 
6251 CCAAATCCCT TTGCTCATTC ACCACCAAAG CATCCAACT? TCTTTGCTTC 
6301 CTCTTTCAAA G C AAC AC AAA ATATTAAATC TCAAGAAACA AAGTAAAGAA 
6351 AAATGAAGAA CGAATCTACC TTCATTGATG TCCCTGCAGA ATCCAGCTCA 
6401 GCCATGAAAG GCAAAGCTCC TCTAATCGGT GTAGCAAGAG ACCACACTAC 
6451 TAGTGGCTCA GGTGGAT 
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FIGURE 6 A 



Expected T-DNA Structure 
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FIGURE SB 

Structure of T-DNA insert in SPY 
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